Pesquisa | BVS - MINISTÉRIO DA SAÚDE

1.

JBrowse 2: a modular genome browser with views of synteny and structural variation.

Diesh, Colin; Stevens, Garrett J; Xie, Peter; De Jesus Martinez, Teresa; Hershberg, Elliot A; Leung, Angel; Guo, Emma; Dider, Shihab; Zhang, Junjun; Bridge, Caroline; Hogue, Gregory; Duncan, Andrew; Morgan, Matthew; Flores, Tia; Bimber, Benjamin N; Haw, Robin; Cain, Scott; Buels, Robert M; Stein, Lincoln D; Holmes, Ian H.

Genome Biol ; 24(1): 74, 2023 04 17.

Artigo em Inglês | MEDLINE | ID: mdl-37069644

RESUMO

We present JBrowse 2, a general-purpose genome annotation browser offering enhanced visualization of complex structural variation and evolutionary relationships. It retains core features of JBrowse while adding new views for synteny, dotplots, breakpoints, gene fusions, and whole-genome overviews. It allows users to share sessions, open multiple genomes, and navigate between views. It can be embedded in a web page, used as a standalone application, or run from Jupyter notebooks or R sessions. These improvements are enabled by a ground-up redesign using modern web technology. We describe application functionality, use cases, performance benchmarks, and implementation notes for web administrators and developers.

Assuntos

Genômica , Software , Sintenia , Genoma , Evolução Biológica , Navegador , Internet

2.

JBrowse Jupyter: a Python interface to JBrowse 2.

De Jesus Martinez, Teresa; Hershberg, Elliot A; Guo, Emma; Stevens, Garrett J; Diesh, Colin; Xie, Peter; Bridge, Caroline; Cain, Scott; Haw, Robin; Buels, Robert M; Stein, Lincoln D; Holmes, Ian H.

Bioinformatics ; 39(1)2023 01 01.

Artigo em Inglês | MEDLINE | ID: mdl-36648320

RESUMO

MOTIVATION: JBrowse Jupyter is a package that aims to close the gap between Python programming and genomic visualization. Web-based genome browsers are routinely used for publishing and inspecting genome annotations. Historically they have been deployed at the end of bioinformatics pipelines, typically decoupled from the analysis itself. However, emerging technologies such as Jupyter notebooks enable a more rapid iterative cycle of development, analysis and visualization. RESULTS: We have developed a package that provides a Python interface to JBrowse 2's suite of embeddable components, including the primary Linear Genome View. The package enables users to quickly set up, launch and customize JBrowse views from Jupyter notebooks. In addition, users can share their data via Google's Colab notebooks, providing reproducible interactive views. AVAILABILITY AND IMPLEMENTATION: JBrowse Jupyter is released under the Apache License and is available for download on PyPI. Source code and demos are available on GitHub at https://github.com/GMOD/jbrowse-jupyter.

Assuntos

Biologia Computacional , Genômica , Software , Genoma , Navegador

3.

JBrowseR: an R interface to the JBrowse 2 genome browser.

Hershberg, Elliot A; Stevens, Garrett; Diesh, Colin; Xie, Peter; De Jesus Martinez, Teresa; Buels, Robert; Stein, Lincoln; Holmes, Ian.

Bioinformatics ; 37(21): 3914-3915, 2021 11 05.

Artigo em Inglês | MEDLINE | ID: mdl-34196689

RESUMO

MOTIVATION: Genome browsers are an essential tool in genome analysis. Modern genome browsers enable complex and interactive visualization of a wide variety of genomic data modalities. While such browsers are very powerful, they can be challenging to configure and program for bioinformaticians lacking expertise in web development. RESULTS: We have developed an R package that provides an interface to the JBrowse 2 genome browser. The package can be used to configure and customize the browser entirely with R code. The browser can be deployed from the R console, or embedded in Shiny applications or R Markdown documents. AVAILABILITY AND IMPLEMENTATION: JBrowseR is available for download from CRAN, and the source code is openly available from the Github repository at https://github.com/GMOD/JBrowseR/.

Assuntos

Genoma , Genômica , Software

4.

JBrowse Connect: A server API to connect JBrowse instances and users.

Yao, Eric; Buels, Robert; Stein, Lincoln; Sen, Taner Z; Holmes, Ian.

PLoS Comput Biol ; 16(8): e1007261, 2020 08.

Artigo em Inglês | MEDLINE | ID: mdl-32810130

RESUMO

We describe JBrowse Connect, an optional expansion to the JBrowse genome browser, targeted at developers. JBrowse Connect allows live messaging, notifications for new annotation tracks, heavy-duty analyses initiated by the user from within the browser, and other dynamic features. We present example applications of JBrowse Connect that allow users 1) to specify and execute BLAST searches by either running on the same host as the webserver, with a self-contained BLAST module leveraging NCBI Blast+ commands, or via a managed Galaxy instance that can optionally run on a different host, and 2) to run the primer design service Primer3. JBrowse Connect allows users to track job progress and view results in the context of the browser. The software is available under a choice of open source licenses including LGPL and the Artistic License.

Assuntos

Bases de Dados Genéticas , Genômica/métodos , Software , Internet

5.

Cram-JS: reference-based decompression in node and the browser.

Buels, Robert; Dider, Shihab; Diesh, Colin; Robinson, James; Holmes, Ian.

Bioinformatics ; 35(21): 4451-4452, 2019 11 01.

Artigo em Inglês | MEDLINE | ID: mdl-31099383

RESUMO

MOTIVATION: The CRAM format addresses rising DNA storage costs for short-read sequencing by aligning reads to a reference genome and encoding the resulting alignment with Huffman, subexponential, Elias gamma, rANS, gzip and other codes. The CRAM codec is complex, and until now, there has been no JavaScript implementation. RESULTS: We have developed a JavaScript library, Cram-JS, that natively reads and decompresses the CRAM format on-the-fly. The library is used in the JBrowse and IGV-JS genome browsers and can readily be used by other JavaScript applications, in the web browser or in Node. AVAILABILITY AND IMPLEMENTATION: Cram-JS is written to the ES-6 standard and is available from the GitHub repository at https://github.com/GMOD/cram-js.

Assuntos

Genoma , Software , DNA , Navegador

6.

FALDO: a semantic standard for describing the location of nucleotide and protein feature annotation.

Bolleman, Jerven T; Mungall, Christopher J; Strozzi, Francesco; Baran, Joachim; Dumontier, Michel; Bonnal, Raoul J P; Buels, Robert; Hoehndorf, Robert; Fujisawa, Takatomo; Katayama, Toshiaki; Cock, Peter J A.

J Biomed Semantics ; 7: 39, 2016 Jun 13.

Artigo em Inglês | MEDLINE | ID: mdl-27296299

RESUMO

BACKGROUND: Nucleotide and protein sequence feature annotations are essential to understand biology on the genomic, transcriptomic, and proteomic level. Using Semantic Web technologies to query biological annotations, there was no standard that described this potentially complex location information as subject-predicate-object triples. DESCRIPTION: We have developed an ontology, the Feature Annotation Location Description Ontology (FALDO), to describe the positions of annotated features on linear and circular sequences. FALDO can be used to describe nucleotide features in sequence records, protein annotations, and glycan binding sites, among other features in coordinate systems of the aforementioned "omics" areas. Using the same data format to represent sequence positions that are independent of file formats allows us to integrate sequence data from multiple sources and data types. The genome browser JBrowse is used to demonstrate accessing multiple SPARQL endpoints to display genomic feature annotations, as well as protein annotations from UniProt mapped to genomic locations. CONCLUSIONS: Our ontology allows users to uniformly describe - and potentially merge - sequence annotations from multiple sources. Data sources using FALDO can prospectively be retrieved using federalised SPARQL queries against public SPARQL endpoints and/or local private triple stores.

Assuntos

Ontologias Biológicas , Anotação de Sequência Molecular/normas , Nucleotídeos/genética , Nucleotídeos/metabolismo , Proteínas/química , Proteínas/metabolismo , Semântica , Bases de Dados Genéticas , Bases de Dados de Proteínas , Lógica Fuzzy , Humanos , Obras de Referência

7.

JBrowse: a dynamic web platform for genome visualization and analysis.

Buels, Robert; Yao, Eric; Diesh, Colin M; Hayes, Richard D; Munoz-Torres, Monica; Helt, Gregg; Goodstein, David M; Elsik, Christine G; Lewis, Suzanna E; Stein, Lincoln; Holmes, Ian H.

Genome Biol ; 17: 66, 2016 Apr 12.

Artigo em Inglês | MEDLINE | ID: mdl-27072794

RESUMO

BACKGROUND: JBrowse is a fast and full-featured genome browser built with JavaScript and HTML5. It is easily embedded into websites or apps but can also be served as a standalone web page. RESULTS: Overall improvements to speed and scalability are accompanied by specific enhancements that support complex interactive queries on large track sets. Analysis functions can readily be added using the plugin framework; most visual aspects of tracks can also be customized, along with clicks, mouseovers, menus, and popup boxes. JBrowse can also be used to browse local annotation files offline and to generate high-resolution figures for publication. CONCLUSIONS: JBrowse is a mature web application suitable for genome visualization and analysis.

Assuntos

Genômica/métodos , Bases de Dados Genéticas , Genoma , Interface Usuário-Computador , Navegador

8.

Web Apollo: a web-based genomic annotation editing platform.

Lee, Eduardo; Helt, Gregg A; Reese, Justin T; Munoz-Torres, Monica C; Childers, Chris P; Buels, Robert M; Stein, Lincoln; Holmes, Ian H; Elsik, Christine G; Lewis, Suzanna E.

Genome Biol ; 14(8): R93, 2013 Aug 30.

Artigo em Inglês | MEDLINE | ID: mdl-24000942

RESUMO

Web Apollo is the first instantaneous, collaborative genomic annotation editor available on the web. One of the natural consequences following from current advances in sequencing technology is that there are more and more researchers sequencing new genomes. These researchers require tools to describe the functional features of their newly sequenced genomes. With Web Apollo researchers can use any of the common browsers (for example, Chrome or Firefox) to jointly analyze and precisely describe the features of a genome in real time, whether they are in the same room or working from opposite sides of the world.

Assuntos

Mapeamento Cromossômico/estatística & dados numéricos , Genoma , Anotação de Sequência Molecular/estatística & dados numéricos , Software , Animais , Aves , Bovinos , Bases de Dados Genéticas , Genômica , Insetos , Internet , Plantas , Análise de Sequência de DNA

9.

The Chado Natural Diversity module: a new generic database schema for large-scale phenotyping and genotyping data.

Jung, Sook; Menda, Naama; Redmond, Seth; Buels, Robert M; Friesen, Maren; Bendana, Yuri; Sanderson, Lacey-Anne; Lapp, Hilmar; Lee, Taein; MacCallum, Bob; Bett, Kirstin E; Cain, Scott; Clements, Dave; Mueller, Lukas A; Main, Dorrie.

Database (Oxford) ; 2011: bar051, 2011.

Artigo em Inglês | MEDLINE | ID: mdl-22120662

RESUMO

Linking phenotypic with genotypic diversity has become a major requirement for basic and applied genome-centric biological research. To meet this need, a comprehensive database backend for efficiently storing, querying and analyzing large experimental data sets is necessary. Chado, a generic, modular, community-based database schema is widely used in the biological community to store information associated with genome sequence data. To meet the need to also accommodate large-scale phenotyping and genotyping projects, a new Chado module called Natural Diversity has been developed. The module strictly adheres to the Chado remit of being generic and ontology driven. The flexibility of the new module is demonstrated in its capacity to store any type of experiment that either uses or generates specimens or stock organisms. Experiments may be grouped or structured hierarchically, whereas any kind of biological entity can be stored as the observed unit, from a specimen to be used in genotyping or phenotyping experiments, to a group of species collected in the field that will undergo further lab analysis. We describe details of the Natural Diversity module, including the design approach, the relational schema and use cases implemented in several databases.

Assuntos

Biodiversidade , Biologia Computacional/métodos , Sistemas de Gerenciamento de Base de Dados , Bases de Dados Factuais , Animais , Genótipo , Internet , Fenótipo , Plantas

10.

The Sol Genomics Network (solgenomics.net): growing tomatoes using Perl.

Bombarely, Aureliano; Menda, Naama; Tecle, Isaak Y; Buels, Robert M; Strickler, Susan; Fischer-York, Thomas; Pujar, Anuradha; Leto, Jonathan; Gosselin, Joseph; Mueller, Lukas A.

Nucleic Acids Res ; 39(Database issue): D1149-55, 2011 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-20935049

RESUMO

The Sol Genomics Network (SGN; http://solgenomics.net/) is a clade-oriented database (COD) containing biological data for species in the Solanaceae and their close relatives, with data types ranging from chromosomes and genes to phenotypes and accessions. SGN hosts several genome maps and sequences, including a pre-release of the tomato (Solanum lycopersicum cv Heinz 1706) reference genome. A new transcriptome component has been added to store RNA-seq and microarray data. SGN is also an open source software project, continuously developing and improving a complex system for storing, integrating and analyzing data. All code and development work is publicly visible on GitHub (http://github.com). The database architecture combines SGN-specific schemas and the community-developed Chado schema (http://gmod.org/wiki/Chado) for compatibility with other genome databases. The SGN curation model is community-driven, allowing researchers to add and edit information using simple web tools. Currently, over a hundred community annotators help curate the database. SGN can be accessed at http://solgenomics.net/.

Assuntos

Bases de Dados Genéticas , Genoma de Planta , Solanum lycopersicum/genética , Perfilação da Expressão Gênica , Genômica , Solanum lycopersicum/crescimento & desenvolvimento , Solanum lycopersicum/metabolismo , Proteínas de Plantas/genética , Software

11.

solQTL: a tool for QTL analysis, visualization and linking to genomes at SGN database.

Tecle, Isaak Y; Menda, Naama; Buels, Robert M; van der Knaap, Esther; Mueller, Lukas A.

BMC Bioinformatics ; 11: 525, 2010 Oct 21.

Artigo em Inglês | MEDLINE | ID: mdl-20964836

RESUMO

BACKGROUND: A common approach to understanding the genetic basis of complex traits is through identification of associated quantitative trait loci (QTL). Fine mapping QTLs requires several generations of backcrosses and analysis of large populations, which is time-consuming and costly effort. Furthermore, as entire genomes are being sequenced and an increasing amount of genetic and expression data are being generated, a challenge remains: linking phenotypic variation to the underlying genomic variation. To identify candidate genes and understand the molecular basis underlying the phenotypic variation of traits, bioinformatic approaches are needed to exploit information such as genetic map, expression and whole genome sequence data of organisms in biological databases. DESCRIPTION: The Sol Genomics Network (SGN, http://solgenomics.net) is a primary repository for phenotypic, genetic, genomic, expression and metabolic data for the Solanaceae family and other related Asterids species and houses a variety of bioinformatics tools. SGN has implemented a new approach to QTL data organization, storage, analysis, and cross-links with other relevant data in internal and external databases. The new QTL module, solQTL, http://solgenomics.net/qtl/, employs a user-friendly web interface for uploading raw phenotype and genotype data to the database, R/QTL mapping software for on-the-fly QTL analysis and algorithms for online visualization and cross-referencing of QTLs to relevant datasets and tools such as the SGN Comparative Map Viewer and Genome Browser. Here, we describe the development of the solQTL module and demonstrate its application. CONCLUSIONS: solQTL allows Solanaceae researchers to upload raw genotype and phenotype data to SGN, perform QTL analysis and dynamically cross-link to relevant genetic, expression and genome annotations. Exploration and synthesis of the relevant data is expected to help facilitate identification of candidate genes underlying phenotypic variation and markers more closely linked to QTLs. solQTL is freely available on SGN and can be used in private or public mode.

Assuntos

Genoma de Planta , Genômica/métodos , Locos de Características Quantitativas/genética , Software , Algoritmos , Bases de Dados Factuais , Bases de Dados Genéticas , Fenótipo , Solanaceae/genética

12.

A community-based annotation framework for linking solanaceae genomes with phenomes.

Menda, Naama; Buels, Robert M; Tecle, Isaak; Mueller, Lukas A.

Plant Physiol ; 147(4): 1788-99, 2008 Aug.

Artigo em Inglês | MEDLINE | ID: mdl-18539779

RESUMO

The amount of biological data available in the public domain is growing exponentially, and there is an increasing need for infrastructural and human resources to organize, store, and present the data in a proper context. Model organism databases (MODs) invest great efforts to functionally annotate genomes and phenomes by in-house curators. The SOL Genomics Network (SGN; http://www.sgn.cornell.edu) is a clade-oriented database (COD), which provides a more scalable and comparative framework for biological information. SGN has recently spearheaded a new approach by developing community annotation tools to expand its curational capacity. These tools effectively allow some curation to be delegated to qualified researchers, while, at the same time, preserving the in-house curators' full editorial control. Here we describe the background, features, implementation, results, and development road map of SGN's community annotation tools for curating genotypes and phenotypes. Since the inception of this project in late 2006, interest and participation from the Solanaceae research community has been strong and growing continuously to the extent that we plan to expand the framework to accommodate more plant taxa. All data, tools, and code developed at SGN are freely available to download and adapt.

Assuntos

Bases de Dados Genéticas , Genoma de Planta , Fenótipo , Solanaceae/genética , Interface Usuário-Computador

13.

Comparative BAC end sequence analysis of tomato and potato reveals overrepresentation of specific gene families in potato.

Datema, Erwin; Mueller, Lukas A; Buels, Robert; Giovannoni, James J; Visser, Richard G F; Stiekema, Willem J; van Ham, Roeland C G J.

BMC Plant Biol ; 8: 34, 2008 Apr 11.

Artigo em Inglês | MEDLINE | ID: mdl-18405374

RESUMO

BACKGROUND: Tomato (Solanum lycopersicon) and potato (S. tuberosum) are two economically important crop species, the genomes of which are currently being sequenced. This study presents a first genome-wide analysis of these two species, based on two large collections of BAC end sequences representing approximately 19% of the tomato genome and 10% of the potato genome. RESULTS: The tomato genome has a higher repeat content than the potato genome, primarily due to a higher number of retrotransposon insertions in the tomato genome. On the other hand, simple sequence repeats are more abundant in potato than in tomato. The two genomes also differ in the frequency distribution of SSR motifs. Based on EST and protein alignments, potato appears to contain up to 6,400 more putative coding regions than tomato. Major gene families such as cytochrome P450 mono-oxygenases and serine-threonine protein kinases are significantly overrepresented in potato, compared to tomato. Moreover, the P450 superfamily appears to have expanded spectacularly in both species compared to Arabidopsis thaliana, suggesting an expanded network of secondary metabolic pathways in the Solanaceae. Both tomato and potato appear to have a low level of microsynteny with A. thaliana. A higher degree of synteny was observed with Populus trichocarpa, specifically in the region between 15.2 and 19.4 Mb on P. trichocarpa chromosome 10. CONCLUSION: The findings in this paper present a first glimpse into the evolution of Solanaceous genomes, both within the family and relative to other plant species. When the complete genome sequences of these species become available, whole-genome comparisons and protein- or repeat-family specific studies may shed more light on the observations made here.

Assuntos

Genoma de Planta , Análise de Sequência de DNA/métodos , Solanum lycopersicum/genética , Solanum tuberosum/genética , Mapeamento Cromossômico , Etiquetas de Sequências Expressas , Genes de Plantas , Proteínas de Plantas/genética

14.

The SGN comparative map viewer.

Mueller, Lukas A; Mills, Adri A; Skwarecki, Beth; Buels, Robert M; Menda, Naama; Tanksley, Steven D.

Bioinformatics ; 24(3): 422-3, 2008 Feb 01.

Artigo em Inglês | MEDLINE | ID: mdl-18202028

RESUMO

MOTIVATION: With the rapid accumulation of genetic data for a multitude of different species, the availability of intuitive comparative genomic tools becomes an important requirement for the research community. Here we describe a web-based comparative viewer for mapping data, including genetic, physical and cytological maps, that is part of the SGN website (http://sgn.cornell.edu/) but that can also be installed and adapted for other websites. In addition to viewing and comparing different maps stored in the SGN database, the viewer allows users to upload their own maps and compare them to other maps in the system. The viewer is implemented in object oriented Perl, with a simple extensible interface to write data adapters for other relational database schemas and flat file formats.

Assuntos

Mapeamento Cromossômico/métodos , Gráficos por Computador , Locos de Características Quantitativas/genética , Alinhamento de Sequência/métodos , Análise de Sequência de DNA/métodos , Software , Interface Usuário-Computador , Algoritmos , Evolução Molecular

15.

The SOL Genomics Network: a comparative resource for Solanaceae biology and beyond.

Mueller, Lukas A; Solow, Teri H; Taylor, Nicolas; Skwarecki, Beth; Buels, Robert; Binns, John; Lin, Chenwei; Wright, Mark H; Ahrens, Robert; Wang, Ying; Herbst, Evan V; Keyder, Emil R; Menda, Naama; Zamir, Dani; Tanksley, Steven D.

Plant Physiol ; 138(3): 1310-7, 2005 Jul.

Artigo em Inglês | MEDLINE | ID: mdl-16010005

RESUMO

The SOL Genomics Network (SGN; http://sgn.cornell.edu) is a rapidly evolving comparative resource for the plants of the Solanaceae family, which includes important crop and model plants such as potato (Solanum tuberosum), eggplant (Solanum melongena), pepper (Capsicum annuum), and tomato (Solanum lycopersicum). The aim of SGN is to relate these species to one another using a comparative genomics approach and to tie them to the other dicots through the fully sequenced genome of Arabidopsis (Arabidopsis thaliana). SGN currently houses map and marker data for Solanaceae species, a large expressed sequence tag collection with computationally derived unigene sets, an extensive database of phenotypic information for a mutagenized tomato population, and associated tools such as real-time quantitative trait loci. Recently, the International Solanaceae Project (SOL) was formed as an umbrella organization for Solanaceae research in over 30 countries to address important questions in plant biology. The first cornerstone of the SOL project is the sequencing of the entire euchromatic portion of the tomato genome. SGN is collaborating with other bioinformatics centers in building the bioinformatics infrastructure for the tomato sequencing project and implementing the bioinformatics strategy of the larger SOL project. The overarching goal of SGN is to make information available in an intuitive comparative format, thereby facilitating a systems approach to investigations into the basis of adaptation and phenotypic diversity in the Solanaceae family, other species in the Asterid clade such as coffee (Coffea arabica), Rubiaciae, and beyond.

Assuntos

Bases de Dados de Ácidos Nucleicos , Genoma de Planta , Solanaceae/genética , Arabidopsis/genética , Capsicum/genética , Biologia Computacional/métodos , DNA de Plantas/genética , Armazenamento e Recuperação da Informação , Solanum lycopersicum/genética , Solanum melongena/genética , Solanum tuberosum/genética

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA